Techniques for processing audible input directed to second device based on user looking at icon presented on display of first device

ABSTRACT

In one aspect, a first device may include at least one processor, a display accessible to the at least one processor, and storage accessible to the at least one processor. The storage may include instructions executable by the at least one processor to present an icon on the display so that the icon appears disposed within the real world at or proximate to a location of a second device associated with the icon. The second device may be different from the first device and exist in the real world. The instructions may also be executable to identify a user of the first device as looking at the icon, receive audible input from the user, and issue a command to the second device to execute at least one function in conformance with the audible input based on the identification of the user as looking at the icon.

FIELD

The present application relates to technically inventive, non-routinesolutions that are necessarily rooted in computer technology and thatproduce concrete technical improvements.

BACKGROUND

Currently, to trigger a digital assistant to listen for voice input froma user to perform a task, utterance a wake-up word or phrase istypically required to notify the digital assistant that ensuing wordsspoken by the user are meant to be provided as input. As recognizedherein, this can create a host of problems. For instance, where multipledigital assistant-enabled devices exist in the same room, multipledifferent wake up words might need to be remembered based on whichassociated device the user wishes to invoke. Furthermore, in situationswhere the user wishes to invoke a digital assistant while carrying on aconversation with another person, uttering the wake up word or phraseinterrupts to the conversation. What's more, even in situations wheredigital assistants might not require utterance of a wake up word to betriggered but multiple associated devices are present in a single room,a user's utterance of a command may be picked up by more than onedigital assistant, leading to unintentional input to unintended devices.There are currently no adequate solutions to the foregoingcomputer-related, technological problem.

SUMMARY

Accordingly, in one aspect a first device includes at least oneprocessor, a display accessible to the at least one processor, andstorage accessible to the at least one processor. The storage includesinstructions executable by the at least one processor to present an iconon the display so that the icon appears disposed within the real worldat or proximate to a location of a second device associated with theicon. The second device is different from the first device and exists inthe real world. The instructions are also executable to identify a userof the first device as looking at the icon, receive audible input fromthe user, and issue a command to the second device to execute at leastone function in conformance with the audible input based on theidentification of the user as looking at the icon.

Proximate to the location of the second device may include within athreshold non-zero distance to the second device, and/or above or nextto the location of the second device at a same depth as the seconddevice relative to the first device.

Additionally, in some examples the instructions may be executable toreceive the audible input from the user while the user is identified aslooking at the icon and then issue the command to the second devicebased on the audible input being received while the user is identifiedas looking at the icon. The user may be identified as looking at theicon based on input from at least one camera included on the firstdevice.

Also in some examples, the instructions may be executable to identifythe user of the first device as looking at the icon for at least athreshold non-zero amount of time, and then to issue the command to thesecond device based on the identification of the user as looking at theicon for at least the threshold non-zero amount of time.

Additionally, in some implementations the instructions may be executableto identify a selection of the icon by the user via user input otherthan the user looking at the icon. The instructions may then beexecutable to present a first graphical user interface (GUI) on thedisplay based on the identification of the selection of the icon, withthe first GUI including one or more graphical elements other than theicon that are related to the second device. Thus, the first GUI aspresented on the display may include elements from a second GUI that thesecond device is configured to present on a display of the second deviceitself. Additionally or alternatively, the first GUI as presented on thedisplay may include a hybrid of respective elements from second andthird GUIs that the second device may be configured to present on thedisplay of the second device itself, where the second and third GUIs aredifferent from each other.

Also in some examples, the instructions may be executable to issue thecommand to the second device based on the identification of the user aslooking at the icon without also receiving a wake up word from the user.Still further, note that the at least one function itself may relate tooperation of hardware of the second device.

In another aspect, a method includes identifying a user of a firstdevice as looking at one or more of a second device different from thefirst device, and a graphical element presented on a display of thefirst device and associated with the second device. The method alsoincludes identifying audible input from the user and, based on theidentifying of the user as looking at one or more of the second deviceand the graphical element, transmitting an indication to the seconddevice to execute at least one function in conformance with the audibleinput.

In some examples, the method may include identifying the user as lookingat the graphical element presented on the display of the first device,and the graphical element may be presented on the display so that thegraphical element appears disposed within the real world at or proximateto a real world location of the second device. Proximate to the realworld location of the second device may be within a threshold non-zerodistance to the second device, and/or may be above or next to thelocation of the second device at a same real world depth as the seconddevice relative to the first device.

Additionally, in some implementations the graphical element may be afirst graphical element, the audible input may be first audible input,and the indication may be a first indication. In these implementations,the method may include presenting a second graphical element on thedisplay of the first device, where the second graphical element may beassociated with a third device different from the first device anddifferent from the second device. The second graphical element may bedifferent from the first graphical element and may be concurrentlypresented on the display with the first graphical element. The methodmay then include identifying the user of the first device as looking atthe second graphical element, identifying second audible input from theuser, and transmitting a second indication to the third device toexecute at least one function in conformance with the second audibleinput based on the identification of the user as looking at the secondgraphical element.

Still further, in some examples the method may include presenting thegraphical element on the display using augmented reality software andidentifying the user as looking at the graphical element presented onthe display of the first device.

Also in some examples, the method may include presenting the graphicalelement on the display responsive to determining that the second deviceitself is not visible to the user relative to the current location ofthe user, and then identifying the user as looking at the graphicalelement presented on the display of the first device.

In another aspect, a method includes presenting a graphical element on adisplay of a first device, with the graphical element associated with asecond device other than the first device. The method also includesidentifying a user of the first device as looking at the graphicalelement, identifying audible input from the user, and transmitting anindication to the second device to execute at least one function inconformance with the audible input based on the identification of theuser as looking at the graphical element.

In some examples, the graphical element may be presented on the displayso that the graphical element appears disposed within the real world ator proximate to a real world location of the second device. Proximate tothe real world location of the second device may be within a thresholdnon-zero distance to the second device, and/or above or next to thelocation of the second device at a same real world depth as the seconddevice relative to the first device.

In some examples, the graphical element may be a first graphicalelement, the audible input may be first audible input, and theindication may be a first indication. In these examples, the method mayinclude presenting a second graphical element on the display of thefirst device, where the second graphical element may be associated witha third device different from the first device and different from thesecond device. The second graphical element may also be different fromthe first graphical element and may be concurrently presented on thedisplay with the first graphical element. The method may then includeidentifying the user of the first device as looking at the secondgraphical element, identifying second audible input from the user, andtransmitting a second indication to the third device to execute at leastone function in conformance with the second audible input based on theidentification of the user as looking at the second graphical element.

Additionally, in some implementations the method may include presentingthe graphical element on the display using augmented reality (AR)software. Also in some implementations, the method may includepresenting the graphical element on the display responsive todetermining that the second device itself is not visible to the userrelative to the current location of the user.

In another aspect, at least one computer readable storage medium (CRSM)that is not a transitory signal includes instructions executable by atleast one processor to identify a user of a first device as looking atone or more of a second device different from the first device, and agraphical element presented on a display of the first device andassociated with the second device. The instructions are also executableto identify audible input from the user and to transmit a notificationto the second device to execute at least one function in conformancewith the audible input based on the identification of the user aslooking at one or more of the second device and the graphical element.

In some examples, the instructions may be executable to identify theuser of the first device as looking at the graphical element presentedon the display of the first device and to transmit the notification tothe second device based on the identification of the user as looking atthe graphical element. The graphical element may be presented on thedisplay of the first device to appear at or proximate to the real worldlocation of the second device.

The details of present principles, both as to their structure andoperation, can best be understood in reference to the accompanyingdrawings, in which like reference numerals refer to like parts, and inwhich:

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 is a block diagram of an example system consistent with presentprinciples;

FIG. 2 is a block diagram of an example network of devices consistentwith present principles;

FIG. 3 shows a top plan view of an augmented reality (AR) headset thatmay be used consistent with present principles;

FIG. 4 shows the perspective of a user of the AR headset while viewingthe real world through the headset's transparent display consistent withpresent principles;

FIG. 5 shows a hybrid graphical user interface (GUI) that may bepresented on the transparent display consistent with present principles;

FIGS. 6 and 7 are flow charts of example algorithms executable by adevice consistent with present principles; and

FIG. 8 shows an example GUI for configuring one or more settings of adevice that operates consistent with present principles.

DETAILED DESCRIPTION

The present application discloses systems and methods for people wearingaugmented reality (AR) headsets such as computerized glasses (e.g., aMicrosoft HoloLens, Google Glass, or Osterhout Design Group (ODG)glasses) to have icons virtually overlaid on their field of view, withthe icons pertaining to other devices controllable using voice input.This might be useful for, e.g., doctors and nurses in a hospital ward.Thus, if a doctor or nurse's speech is meant to be directed toward acertain device as voice input, the doctor/nurse can fixate his or hergaze onto the specific virtual device icon associated with that devicethat is in his or her field of view while speaking (or fix his or hergaze as looking near or in the direction of the icon). Consequently, theAR headset and/or other device may use a digital assistant (e.g.,Apple's Siri, Amazon's Alexa, Google's Assistant etc.) to know that partof the speech is being directed towards it. Thus, medical staff membersmay interact with different devices in the ward (e.g., computational andmedical) without interrupting the normal conversation flow with apatient and relatives in the same room. The other devices might include,e.g., an x-ray machine, an ultrasound machine, an EKG machine, a bloodpressure machine, an oxygen level monitor, IV delivery equipment, etc.

For example, a nurse might look at an icon presented on the transparentdisplay of his headset that is associated with an intravenous (IV)machine and utter, “I will start an IV line on you, and give you a 10 mgsleeping pill, called benzodiazepine, right now”. That in turn may bedetected by the headset and the headset may then issue a command to theIV machine such as “automatically configure IV device to administerfluids now”.

As another example, a nurse might speak in natural language that “I amputting EKG probes on your chest, and we will monitor your heart for 10minutes, okay” while gazing at a virtual icon for an associatedelectrocardiogram (EKG) machine. That in turn may be detected by theheadset and the headset may then issue a command to the EKG machine suchas “automatically set timer of EKG equipment for 10 minutes, begin timerand monitor”.

As still another example, a doctor might speak “So your fever startedthree days ago, accompanied with symptoms such as coughing and nausea”while gazing at virtual icon associated with a computer that is separatefrom the doctor's headset. That in turn may be detected by the headsetand the headset may then issue a command to the computer to input therelated information into an electronic medical record (EMR) system(e.g., “fever for three days, coughing, nausea”).

In addition to or in lieu of the foregoing, note that in variousembodiments the other machine may have its own microphone and digitalassistant for processing voice input. In these embodiments, the headsetmay be used to determine which other machine the headset's wearer/useris looking at (or looking near or in the direction of) and then transmitan indication to that machine that any voice input that the othermachine detects itself while the headset is indicating the user aslooking at the other machine is to be processed by the machine.

Furthermore, note that present principles may be used even where thephysical machine itself might not be visible to the user from the user'scurrent field of view. For example and again in the medical context, themachine might be located behind the back of another doctor/nurse or in aclosed cabinet or under the patient's bed.

For interaction with a machine for a relatively short period of time,gaze-while-talking may be used where any utterances the headsetwearer/user makes while looking at an icon may get directed to theassociated machine itself. But, in some examples for interaction for arelatively longer period of time with a given machine, the user's gazecan be used to lock the headset on a certain icon for any utterancesspoken by the user subsequent to that to be directed to the associatedmachine even if the user stops looking at the associated icon whileproviding the utterance(s). The other machine may then bereleased/unlocked after the user's interaction is determined to becomplete, after a threshold amount of time passes during which noaudible input is detected, and/or upon an unlock command being receivedfrom the user.

Further still, the user of the headset may select an icon presented onthe headset's transparent display using input other than eye input. Thismay be used as a command to the headset for the headset to present moredetails of a user interface of the machine. The user may then gaze atdifferent parts of that user interface to provide further user input foreven more interaction.

Prior to delving further into the details of the instant techniques,note that present principles may apply in other contexts besides healthcare. For example, present principles may be applied to devices/machineswithin a residential household, a lab, a warehouse, etc.

Also note with respect to any computer systems discussed herein that asystem may include server and client components, connected over anetwork such that data may be exchanged between the client and servercomponents. The client components may include one or more computingdevices including televisions (e.g., smart TVs, Internet-enabled TVs),computers such as desktops, laptops and tablet computers, so-calledconvertible devices (e.g., having a tablet configuration and laptopconfiguration), and other mobile devices including smart phones. Theseclient devices may employ, as non-limiting examples, operating systemsfrom Apple Inc. of Cupertino Calif., Google Inc. of Mountain View,Calif., or Microsoft Corp. of Redmond, Wash. A Unix® or similar such asLinux® operating system may be used. These operating systems can executeone or more browsers such as a browser made by Microsoft or Google orMozilla or another browser program that can access web pages andapplications hosted by Internet servers over a network such as theInternet, a local intranet, or a virtual private network.

As used herein, instructions refer to computer-implemented steps forprocessing information in the system. Instructions can be implemented insoftware, firmware or hardware, or combinations thereof and include anytype of programmed step undertaken by components of the system; hence,illustrative components, blocks, modules, circuits, and steps aresometimes set forth in terms of their functionality.

A processor may be any general purpose single- or multi-chip processorthat can execute logic by means of various lines such as address lines,data lines, and control lines and registers and shift registers.Moreover, any logical blocks, modules, and circuits described herein canbe implemented or performed with a general purpose processor, a digitalsignal processor (DSP), a field programmable gate array (FPGA) or otherprogrammable logic device such as an application specific integratedcircuit (ASIC), discrete gate or transistor logic, discrete hardwarecomponents, or any combination thereof designed to perform the functionsdescribed herein. A processor can also be implemented by a controller orstate machine or a combination of computing devices. Thus, the methodsherein may be implemented as software instructions executed by aprocessor, suitably configured application specific integrated circuits(ASIC) or field programmable gate array (FPGA) modules, or any otherconvenient manner as would be appreciated by those skilled in those art.Where employed, the software instructions may also be embodied in anon-transitory device that is being vended and/or provided that is not atransitory, propagating signal and/or a signal per se (such as a harddisk drive, CD ROM or Flash drive). The software code instructions mayalso be downloaded over the Internet. Accordingly, it is to beunderstood that although a software application for undertaking presentprinciples may be vended with a device such as the system 100 describedbelow, such an application may also be downloaded from a server to adevice over a network such as the Internet.

Software modules and/or applications described by way of flow chartsand/or user interfaces herein can include various sub-routines,procedures, etc. Without limiting the disclosure, logic stated to beexecuted by a particular module can be redistributed to other softwaremodules and/or combined together in a single module and/or madeavailable in a shareable library.

Logic when implemented in software, can be written in an appropriatelanguage such as but not limited to C# or C++, and can be stored on ortransmitted through a computer-readable storage medium (that is not atransitory, propagating signal per se) such as a random access memory(RAM), read-only memory (ROM), electrically erasable programmableread-only memory (EEPROM), compact disk read-only memory (CD-ROM) orother optical disk storage such as digital versatile disc (DVD),magnetic disk storage or other magnetic storage devices includingremovable thumb drives, etc.

In an example, a processor can access information over its input linesfrom data storage, such as the computer readable storage medium, and/orthe processor can access information wirelessly from an Internet serverby activating a wireless transceiver to send and receive data. Datatypically is converted from analog signals to digital by circuitrybetween the antenna and the registers of the processor when beingreceived and from digital to analog when being transmitted. Theprocessor then processes the data through its shift registers to outputcalculated data on output lines, for presentation of the calculated dataon the device.

Components included in one embodiment can be used in other embodimentsin any appropriate combination. For example, any of the variouscomponents described herein and/or depicted in the Figures may becombined, interchanged or excluded from other embodiments.

“A system having at least one of A, B, and C” (likewise “a system havingat least one of A, B, or C” and “a system having at least one of A, B,C”) includes systems that have A alone, B alone, C alone, A and Btogether, A and C together, B and C together, and/or A, B, and Ctogether, etc.

The term “circuit” or “circuitry” may be used in the summary,description, and/or claims. As is well known in the art, the term“circuitry” includes all levels of available integration, e.g., fromdiscrete logic circuits to the highest level of circuit integration suchas VLSI, and includes programmable logic components programmed toperform the functions of an embodiment as well as general-purpose orspecial-purpose processors programmed with instructions to perform thosefunctions.

Now specifically in reference to FIG. 1, an example block diagram of aninformation handling system and/or computer system 100 is shown that isunderstood to have a housing for the components described below. Notethat in some embodiments the system 100 may be a desktop computersystem, such as one of the ThinkCentre® or ThinkPad® series of personalcomputers sold by Lenovo (US) Inc. of Morrisville, N.C., or aworkstation computer, such as the ThinkStation®, which are sold byLenovo (US) Inc. of Morrisville, N.C.; however, as apparent from thedescription herein, a client device, a server or other machine inaccordance with present principles may include other features or onlysome of the features of the system 100. Also, the system 100 may be,e.g., a game console such as XBOX®, and/or the system 100 may include amobile communication device such as a mobile telephone, notebookcomputer, and/or other portable computerized device.

As shown in FIG. 1, the system 100 may include a so-called chipset 110.A chipset refers to a group of integrated circuits, or chips, that aredesigned to work together. Chipsets are usually marketed as a singleproduct (e.g., consider chipsets marketed under the brands INTEL®, AMD®,etc.).

In the example of FIG. 1, the chipset 110 has a particular architecture,which may vary to some extent depending on brand or manufacturer. Thearchitecture of the chipset 110 includes a core and memory control group120 and an I/O controller hub 150 that exchange information (e.g., data,signals, commands, etc.) via, for example, a direct management interfaceor direct media interface (DMI) 142 or a link controller 144. In theexample of FIG. 1, the DMI 142 is a chip-to-chip interface (sometimesreferred to as being a link between a “northbridge” and a“southbridge”).

The core and memory control group 120 include one or more processors 122(e.g., single core or multi-core, etc.) and a memory controller hub 126that exchange information via a front side bus (FSB) 124. As describedherein, various components of the core and memory control group 120 maybe integrated onto a single processor die, for example, to make a chipthat supplants the “northbridge” style architecture.

The memory controller hub 126 interfaces with memory 140. For example,the memory controller hub 126 may provide support for DDR SDRAM memory(e.g., DDR, DDR2, DDR3, etc.). In general, the memory 140 is a type ofrandom-access memory (RAM). It is often referred to as “system memory.”

The memory controller hub 126 can further include a low-voltagedifferential signaling interface (LVDS) 132. The LVDS 132 may be aso-called LVDS Display Interface (LDI) for support of a display device192 (e.g., a CRT, a flat panel, a projector, a touch-enabled lightemitting diode display or other video display, etc.). A block 138includes some examples of technologies that may be supported via theLVDS interface 132 (e.g., serial digital video, HDMI/DVI, display port).The memory controller hub 126 also includes one or more PCI-expressinterfaces (PCI-E) 134, for example, for support of discrete graphics136. Discrete graphics using a PCI-E interface has become an alternativeapproach to an accelerated graphics port (AGP). For example, the memorycontroller hub 126 may include a 16-lane (x16) PCI-E port for anexternal PCI-E-based graphics card (including, e.g., one of more GPUs).An example system may include AGP or PCI-E for support of graphics.

In examples in which it is used, the I/O hub controller 150 can includea variety of interfaces. The example of FIG. 1 includes a SATA interface151, one or more PCI-E interfaces 152 (optionally one or more legacy PCIinterfaces), one or more USB interfaces 153, a LAN interface 154 (moregenerally a network interface for wired and/or wireless communicationover at least one network such as the Internet, a WAN, a LAN, etc. underdirection of the processor(s) 122), a general purpose I/O interface(GPIO) 155, a low-pin count (LPC) interface 170, a power managementinterface 161, a clock generator interface 162, an audio interface 163(e.g., for speakers 194 to output audio), a total cost of operation(TCO) interface 164, a system management bus interface (e.g., amulti-master serial computer bus interface) 165, and a serial peripheralflash memory/controller interface (SPI Flash) 166, which, in the exampleof FIG. 1, includes BIOS 168 and boot code 190. With respect to networkconnections, the I/O hub controller 150 may include integrated gigabitEthernet controller lines multiplexed with a PCI-E interface port. Othernetwork features may operate independent of a PCI-E interface.

The interfaces of the I/O hub controller 150 may provide forcommunication with various devices, networks, etc. For example, whereused, the SATA interface 151 provides for reading, writing or readingand writing information on one or more drives 180 such as HDDs, SDDs ora combination thereof, but in any case the drives 180 are understood tobe, e.g., tangible computer readable storage mediums that are nottransitory, propagating signals. The I/O hub controller 150 may alsoinclude an advanced host controller interface (AHCI) to support one ormore drives 180. The PCI-E interface 152 allows for wireless connections182 to devices, networks, etc. The USB interface 153 provides for inputdevices 184 such as keyboards (KB), mice and various other devices(e.g., cameras, phones, storage, media players, etc.).

In the example of FIG. 1, the LPC interface 170 provides for use of oneor more ASICs 171, a trusted platform module (TPM) 172, a super I/O 173,a firmware hub 174, BIOS support 175 as well as various types of memory176 such as ROM 177, Flash 178, and non-volatile RAM (NVRAM) 179. Withrespect to the TPM 172, this module may be in the form of a chip thatcan be used to authenticate software and hardware devices. For example,a TPM may be capable of performing platform authentication and may beused to verify that a system seeking access is the expected system.

The system 100, upon power on, may be configured to execute boot code190 for the BIOS 168, as stored within the SPI Flash 166, and thereafterprocesses data under the control of one or more operating systems andapplication software (e.g., stored in system memory 140). An operatingsystem may be stored in any of a variety of locations and accessed, forexample, according to instructions of the BIOS 168.

The system 100 may further include an audio receiver/microphone 195 thatprovides input from the microphone 195 to the processor 122 based onaudio that is detected, such as via a user providing audible input tothe microphone 195 consistent with present principles. Still further,the system 100 may include a camera 193 that gathers one or more imagesand provides input related thereto to the processor 122. The camera 193may be a thermal imaging camera, an infrared (IR) camera, a digitalcamera such as a webcam, a three-dimensional (3D) camera, and/or acamera otherwise integrated into the system 100 and controllable by theprocessor 122 to gather pictures/images and/or video.

Additionally, though not shown for simplicity, in some embodiments thesystem 100 may include a gyroscope that senses and/or measures theorientation of the system 100 and provides input related thereto to theprocessor 122, as well as an accelerometer that senses accelerationand/or movement of the system 100 and provides input related thereto tothe processor 122. Also, the system 100 may include a GPS transceiverthat is configured to communicate with at least one satellite toreceive/identify geographic position information and provide thegeographic position information to the processor 122. However, it is tobe understood that another suitable position receiver other than a GPSreceiver may be used in accordance with present principles to determinethe location of the system 100.

It is to be understood that an example client device or othermachine/computer may include fewer or more features than shown on thesystem 100 of FIG. 1. In any case, it is to be understood at least basedon the foregoing that the system 100 is configured to undertake presentprinciples.

Turning now to FIG. 2, example devices are shown communicating over anetwork 200 such as the Internet in accordance with present principles.It is to be understood that each of the devices described in referenceto FIG. 2 may include at least some of the features, components, and/orelements of the system 100 described above. Indeed, any of the devicesdisclosed herein may include at least some of the features, components,and/or elements of the system 100 described above.

FIG. 2 shows a notebook computer and/or convertible computer 202, adesktop computer 204, a wearable device 206 such as a smart watch, asmart television (TV) 208, a smart phone 210, a tablet computer 212, aheadset 216, and a server 214 such as an Internet server that mayprovide cloud storage accessible to the devices 202-212, 216. It is tobe understood that the devices 202-216 are configured to communicatewith each other over the network 200 to undertake present principles.

Now describing FIG. 3, it shows a top plan view of a headset, such asthe headset 216, consistent with present principles. The headset 216 mayinclude a housing 300, at least one processor 302 in the housing, and atransparent “heads up” display 304 accessible to the at least oneprocessor and coupled to the housing. Additionally, the headset 316 mayinclude storage 308 accessible to the processor 302 and coupled to thehousing 300, as well as one or more cameras 310, 312 accessible to theprocessor 302 and coupled to the housing 300 for use as disclosedherein. Thus, the cameras 310, 312 may be oriented to face away from theheadset 216 in the front-facing direction in which a user's head wouldbe oriented when wearing the headset 216. The headset 216 may alsoinclude one or more cameras 314 oriented inward to image the user's eyeswhile the user wears the headset 216 for eye tracking consistent withpresent principles.

Still further, note that the headset 216 may include still othercomponents not shown for simplicity, such as a network interface forcommunicating over a network such as the Internet and a battery forpowering components of the headset 216. Additionally, note that whilethe headset 216 is illustrated as an augmented reality (AR) headset, theheadset 216 may also be established by another type headset such as avirtual reality (VR) headset that may not have a transparent display butmay still be able to present electronic content such as virtual AR iconsalong with a real-world, real-time camera feed of an environment imagedby one or more of the cameras 310, 312 to provide an AR experience tothe user. Also note that electronic contact lenses with their ownrespective heads up displays may also be used consistent with presentprinciples.

Now describing FIG. 4, it shows a perspective 400 of a user whilewearing a headset having a transparent display 402 through which thereal world may be viewed. In this example, the wearer/user of theheadset is disposed within a hospital room and, relative to theperspective 400 shown, a patient 404 is partially shown as laying on ahospital bed 406 while hooked up to an intravenous (IV) machine/device408 for the machine 408 to administer fluids and medicine to the patient404 according to a prescribed flow rate. The prescribed flow rate itselfmay be represented on an electronic display 410 of the machine 408.

Based on the user's headset receiving camera images from a camera on theheadset that faces outward away from the headset to mimic the actualview of the user through the display 402, the headset may execute objectrecognition to identify and track the location of the machine 408relative to the headset as the user moves. The camera images may also beused to identify a television 412 mounted on a wall within the room.

Additionally, though not actually visible to the user or headset itselffrom their current locations, the headset may identify anelectrocardiograph (EKG) machine 414 that is located within a cabinet416 that has its doors closed to obstruct any view of the machine 414.Owing to the headset's camera being unable to capture images of themachine 414 since it is not visible from outside the cabinet 416, theheadset may identify the location of the machine 414 other ways. Forexample, the machine 414 may communicate its GPS coordinates to theheadset, the headset may already be preprogrammed with the location ofthe machine 414, and/or the headset may know the location of the machine414 based on previous simultaneous localization and mapping (SLAM)performed in conjunction with another headset while the doors of thecabinet 416 were open and the machine 414 visible. Still further, beaconsignals may be received from the machine 414 that indicate its location.Additionally, a quick response code (or label with a machine identitynumber) associated with the machine 414 and disposed on an exteriorsurface of the cabinet 416 and visible to the camera may be identifiedusing object recognition to thus identify the location of the machine414 as being in the container (cabinet 416) on which the QR code isdisposed. Signal triangulation and received signal strength indicatoralgorithms may also be used to identify the location of the machine 414based on wireless signals received therefrom.

Once the machines/devices 408, 412, and 414 and their respectivelocations have been identified through one method or another, associatedicons 418, 420, and 422 (or other graphical elements such as thumbnailimages of the machines themselves) may be presented on the display 402.As shown, the icon 418 may show fluid in a container to denote the IVmachine 408, the icon 420 may show a television set to denote thetelevision 412, and the icon 422 may be an oval with the text “EKG”within it to denote the EKG machine 414.

The icons 418, 420, and 422 may be presented on the display 402 usingaugmented reality software so that the icons 418, 420, 422 appear to theuser to be located not on the display 402 itself but proximate to or atthe real-world locations of the respective machines 408, 412, and 414.For example, icons 418 and 420 may respectively be presented a thresholdnon-zero distance above the respective real-world locations of therespective machines 408, 412 themselves (e.g., three inches above) inthe X-Y plane and at a same depth as the machines 408, 412 relative tothe current location of the headset itself. However, further note thaticons associated with respective machines may also be presented toappear at other locations within the threshold distance of therespective machines, such as to next the associated machines (e.g., tothe left or right) and/or below the associated machines.

Still further, in some examples an associated icon for a givendevice/machine may be presented on the display 402 using augmentedreality software so that it appears to the user as actually disposed atthe current real-world location of the associated device/machine itselfin all three dimensions. An example of this is shown via the icon 422,which is presented via the display 402 to appear as though disposed atthe actual real-world location of the machine 414 within the cabinet416. In some examples, an associated icon being presented as thoughdisposed at the actual location of the associated machine (rather thanproximate to it) may only occur when the headset determines that theassociated machine itself is not currently visible to the user, andotherwise icons may be presented to appear proximate to but not at thecurrent real world locations of visible machines. However, in otherexamples an associated icon may be presented as though disposed at theactual location of the associated machine even if the associated machineis visible based on user preference, configurations set by a systemadministrator, etc.

As also shown in FIG. 4, in some examples the headset may presentinstructions 424 on the display 402. The instructions 424 may includetext indicating that the user should stare at a particular icon andspeak voice input in order for the voice input to be wirelessly routedto a machine associated with the icon for processing or execution of acommand/function in conformance with the voice input.

The instructions 424 may also include text indicating that the usershould stare at a particular icon for a threshold amount of time (e.g.,five seconds) to “lock” voice input as being routed to the associatedmachine. “Locking” voice input to the associated machine may be used sothat the user may speak voice input that will be routed to theassociated machine for processing even if the user diverts his or hergaze away from staring at the particular icon while speaking (or evenspeaking after looking away and pausing). The user may then “unlock”voice input from always being routed to the associated machine by againstaring at the associated icon for the same or a different thresholdperiod of time, and/or by blinking a predetermined number of times(e.g., twice) within a threshold period of time when again gazing at theassociated icon. The user may also “unlock” voice input by staring for athreshold amount of time at a release selector 426 presented on thedisplay 402, and/or selecting the selector 426 another way such as viavoice input or an “air tap” where the user gestures to touch theassociated icon where it appears to be located in the real world (astracked by the headset's outward-facing camera(s)).

Note that where the user is gazing and for how long may itself bedetermined based on execution of eye tracking software using images fromone or more cameras on the headset that are oriented inward to image theuser's eyes.

Then once the user has been identified as looking at a given icon fordirecting voice input thereto, and/or once the user “locks” voice inputas being routed to a given machine, additional graphical elements 428may be presented to visually highlight the selected icon and associatedmachine. In this example, the icon 420 for the associated machine 412has been selected and arrows 428 have been presented to highlight asmuch.

It is to also be understood consistent with present principles that insome embodiments the respective icons themselves that are associatedwith various devices/machines may be further selectable to provide acommand to the headset for a detailed graphical user interface (GUI)such as the GUI 500 of FIG. 5 to be presented on the display 402. Theicons may be selected for such purposes via air tap, voice input, gazingconstantly at a given icon until a graphical hourglass 430 empties itssand to a lower chamber, etc. The icons may also be selected using acursor under control of a mouse or track pad, based on selection of apush button on the housing of the headset itself, and/or based onselection of a push button or soft button on another input device inwired or wireless communication with the headset (e.g., a remote controlor smart phone).

Once an icon is selected according to the paragraph above, one or moreGUIs presentable on a display of the associated machine itself may bepresented on the display 402 of the headset, e.g., based on wirelesscommunication of the GUI(s) to the headset from the associated machine.As shown in FIG. 5, two GUIs 502, 504 that are separately presentable onthe display 410 of the IV machine 408 at different times (e.g., time“T1” and time “T2” as shown) may be combined into the single hybrid GUI500 presented on the display 402 of the headset. This might occur evenif the GUIs 502, 504 are not concurrently presentable together on thedisplay 410 owing to a limited amount of display space on which topresent the GUIs 502, 504, owing to the GUIs 502, 504 otherwise beingpresented only upon separate user commands, etc. Also, note that the GUI500 may be updated in real time as the machine 408 would otherwiseupdate the GUIs 502, 504 based on its operation, administration offluids, user commands, updates to data related to the vital signs 503 ofthe patient 404, etc.

As also shown in FIG. 5, the hybrid GUI 500 may include some or all ofthe same graphical elements as presented on the respective GUIs 502, 504themselves (e.g., same selectors, prompts, and/or data representations).For example, the GUI 500 may include the warning from the GUI 502 that adesired dose of prescription fluids exceeds a common medical limit,along with a yes selector 506 from the GUI 502 that is selectable toadminister the fluids anyway and a no selector 508 from the GUI 502 todecline to do so. The selector 510 of the GUI 504 to change the flowrate of fluids administered via the machine 408 may also be presented onthe combined GUI 500 of FIG. 5. The same can be said for the startselector 512, stop selector 514, and menu selector 516 of the GUI 504that are respectively selectable to command the machine 408 to startadministration of the fluids, stop administration of the fluids, orpresent a menu GUI associated with the machine 408. The respectiveselectors themselves as presented on the GUI 500 may be selectable usingany of the methods described herein, such as via air tap, voice command,selection using a cursor, staring for a threshold period of time, etc.

Now referring to FIG. 6, it shows example logic that may be executed bya device such as the system 100 and/or a headset with transparentdisplay in accordance with present principles. For example, the logic ofFIG. 6 may be executed by the headset described above in reference toFIGS. 4 and 5. Also note that in some examples the logic of FIG. 6 maybe executed in part by a remotely-located server in communication withthe headset.

Beginning at block 600, the device may track its position usingaugmented reality software, a SLAM algorithm, image registration andcomputer vision, etc. Also at block 600, the device may use augmentedreality software and/or stereoscopic images to present icons forassociated devices/machines located in the real world that are in thesame room or area as the headset consistent with present principles. Theicons may be presented on the device's transparent display so that theyappear at least proximate to the real-world location of the associatedmachines. From block 600 the logic may then proceed to block 602.

At block 602 the device may execute an eye tracking algorithm to trackthe user's line of sight based on camera input that indicates thedirection and depth of the user's gaze. From block 602 the logic maythen proceed to block 604 where the device may receive input from amicrophone on the device and identify the user as speaking audible inputbased on the input from the microphone. For example, the device may usevoice recognition to determine that the user has potentially provided acommand or other voice input to the device that the device is to routeto another machine such as, e.g., an IV machine within a hospital roomfor execution of a function by the IV machine.

Then to determine whether the user has in fact provided voice input oris merely speaking words that were not meant to be provided as input,the logic may move to decision diamond 606 where the device maydetermine whether the user, while speaking, is looking at an iconpresented on the device's transparent display that is associated withanother machine. In some but not all examples, the user must bedetermined to be looking at the icon for at least a first thresholdnon-zero amount of time (e.g., two seconds) to avoid false positives dueto the user quickly glancing at the icon while speaking words that werenot meant to be provided as input.

Additionally, note consistent with present principles that in someexamples the user may also look at the associated machine itself asdisposed in the real world in order to provide input to it throughspeaking (rather than looking at an associated icon that might bepresented on the device's transparent display). In these examples, thedetermination at diamond 606 may therefore be based on whether the useris looking at the machine itself while speaking.

In either case, a negative determination at diamond 606 may cause thelogic to revert back to block 600 and proceed therefrom. However, anaffirmative determination at diamond 606 that the user is in factlooking at an icon (or associated machine) may instead cause the logicto proceed to block 608. At block 608 the device may, without receivinga wake up word/phrase at any point, issue a command or notification tothe associated machine to execute a function in conformance with theaudible input from the user. A digital assistant executing at themachine itself may then be used to execute the function indicated by thedevice. The function may include adjusting operation of hardware of themachine (e.g., adjusting the rate at which an IV machine administersfluids, turning on sensors for different vital signs that the machine isto monitor, etc.), audibly reporting data to the user, presenting acertain menu on the machine's display, etc.

Also at block 608, in some examples the device may transmit a command tostill other machines in the same room or area to not execute anyfunction in conformance with the audible input, e.g., if digitalassistants executing at the other machines would otherwise pick up onthe audible input and attempt to execute a command accordingly.

From block 608 the device may then proceed to decision diamond 610. Atdiamond 610 the device may determine whether the user is staring at theassociated icon for a second threshold non-zero amount of time that maybe longer than the first threshold non-zero amount of time describedabove in reference to diamond 606. Staring for the second thresholdnon-zero amount of time may be done in order to lock voice input asbeing directed to the associated machine as described above.

A negative determination at diamond 610 may cause the logic to revertback to block 600 and proceed therefrom. However, an affirmativedetermination at diamond 610 may instead cause the logic to proceed toblock 612. At block 612 the device may continue to route commands ornotifications to the machine to execute functions in conformance withadditional audible input that might be received regardless of whetherthe user continues to look at the associated icon or not. The logic ofFIG. 6 may then proceed to decision diamond 614.

At diamond 614 the device may determine whether to unlock voice inputfrom being routed to the associated machine consistent with thedescription above. A negative determination at diamond 614 may cause thelogic to continue making the determination at diamond 614 until suchtime as an affirmative determination is made. Then upon an affirmativedetermination being made at diamond 614 the logic may proceed to block616. At block 616 the logic may revert back to block 600 and proceedtherefrom.

Now describing FIG. 7, it also shows example logic that may be executedby the same device that executes the logic of FIG. 6 (e.g., an ARheadset) in order to present a hybrid GUI at the device consistent withpresent principles. The logic of FIG. 7 may therefore be executed inconjunction with the logic of FIG. 6.

Beginning at diamond 700, the device may determine whether the user hasselected an icon presented on the device's transparent display. An iconmay be selected using any of the non-staring methods described above inreference to FIGS. 4 and 5, for example. A negative determination atdiamond 700 may cause the logic to continue making the determination atdiamond 700 until such time as an affirmative determination is made.Then upon an affirmative determination being made at diamond 700 thelogic may proceed to block 702.

At block 702 the device may communicate with another machine/deviceassociated with the icon to receive and/or merge GUIs of the associatedmachine for presentation of a hybrid GUI using the device's transparentdisplay consistent with present principles. The logic may then move toblock 704 where the device may use augmented reality software toactually present the hybrid GUI on the device's transparent display.

Continuing the detailed description in reference to FIG. 8, it shows anexample GUI 800 that may be presented on the display of a deviceundertaking present principles, such as a transparent display of an ARheadset and/or the display of a smart phone communicating with the ARheadset to undertake present principles. It is to be understood thateach of the options to be discussed below in reference to FIG. 8 may beselected by selecting the respective check box shown adjacent to therespective option.

As shown in FIG. 8, the GUI 800 may include a first option 802 that maybe selectable to enable or set the device to route voice input to othermachines based on a user of the device being identified as looking atrespective icons while providing the voice input. For example, theoption 802 may be selected to enable the device to undertake theoperations discussed above in reference to FIGS. 4 and 5 as well as toexecute the logic of FIGS. 6 and 7.

The GUI 800 may also include an option 804 that may be selectable toenable or set the device to only present icons for machines that are notvisible to the user from the user's current location and perspective.Thus, should the option 804 be selected, the device may be configured toonly present icons for machines that are not currently visible to theuser but the user may still direct input to other machines that arevisible but do not have associated icons presented on the device'stransparent display simply by looking at the associated machine itself.

The example GUI 800 of FIG. 8 may also include settings 806 and 808. Thesetting 806 may be controlled to configure the threshold amount of timeused at diamond 606 as described above to avoid false positives due tothe user quickly glancing at an icon while speaking words that were notmeant to be provided as input. Accordingly, an input box 810 may bepresented as part of the GUI 800 for the user to provide numerical inputto establish that threshold amount of time.

The setting 808 may be controlled to configure the threshold amount oftime used at diamond 610 to lock voice input as being directed to anassociated machine. Accordingly, an input box 812 may be presented aspart of the GUI 800 for the user to provide numerical input to establishthat threshold amount of time.

As also shown in FIG. 8, the GUI 800 may include an option 814 that maybe selectable to configure the device to merge different GUIspresentable at a different machine at different times as disclosedherein. Thus, selection of the option 814 may configure the device tomerge different GUIs as disclosed herein and in certain examples wherethe option 814 is not selected, the device may instead simply presentthe same single GUI on its transparent display that is currentlypresented on the display of the other machine itself.

Before concluding, it is to be understood that the functions describedabove as being performed by a headset may also be undertaken by devicesother than headsets, such as smart phones or tablet computers havingtheir own camera feeds of environments that may be presented on theirown respective touch-enabled displays along with the icons and othergraphical elements disclosed herein. For instance, a smart phone orother computing device might also undertake the logic of FIGS. 6 and 7and present an icon associated with another device on its display, evenif the smart phone's display is not transparent, by using augmentedreality software to still present the icon on its display as if disposedat the real-world location of the other device.

Additionally, note that a first device (e.g., the smart phone in thepreceding sentence) may identify a user as looking at a second devicefor identifying audible input to route to the second device consistentwith present principles even if the first device does not have a displayat all (e.g., if the first device is a head-mounted camera system notincluding a display). Present principles may also apply in instanceswhere a system remote from the user routes audible input to anotherdevice based on identification of the user as looking at the otherdevice using input from cameras remote from the user but still withinthe user's environment to track the user's line of sight.

It may now be appreciated that present principles provide for animproved computer-based user interface that improves the functionalityand ease of use of the devices disclosed herein. The disclosed conceptsare rooted in computer technology for computers to carry out theirfunctions.

It is to be understood that whilst present principals have beendescribed with reference to some example embodiments, these are notintended to be limiting, and that various alternative arrangements maybe used to implement the subject matter claimed herein. Componentsincluded in one embodiment can be used in other embodiments in anyappropriate combination. For example, any of the various componentsdescribed herein and/or depicted in the Figures may be combined,interchanged or excluded from other embodiments.

What is claimed is:
 1. A first device, comprising: at least oneprocessor; a display accessible to the at least one processor; andstorage accessible to the at least one processor and comprisinginstructions executable by the at least one processor to: present anicon on the display so that the icon appears disposed within the realworld at or proximate to a location of a second device associated withthe icon, the second device being different from the first device andexisting in the real world; identify a user of the first device aslooking at the icon; receive audible input from the user; and based onthe identification of the user as looking at the icon, issue a commandto the second device to execute at least one function in conformancewith the audible input.
 2. The first device of claim 1, whereinproximate to the location of the second device comprises one or more of:within a threshold non-zero distance to the second device, above or nextto the location of the second device at a same depth as the seconddevice relative to the first device.
 3. The first device of claim 1,wherein the instructions are executable to: receive the audible inputfrom the user while the user is identified as looking at the icon; andbased on the audible input being received while the user is identifiedas looking at the icon, issue the command to the second device.
 4. Thefirst device of claim 1, comprising at least one camera, and wherein theinstructions are executable to: identify the user of the first device aslooking at the icon based on input from the at least one camera.
 5. Thefirst device of claim 1, wherein the instructions are executable to:identify the user of the first device as looking at the icon for atleast a threshold non-zero amount of time; and based on theidentification of the user as looking at the icon for at least thethreshold non-zero amount of time, issue the command to the seconddevice.
 6. The first device of claim 1, wherein the instructions areexecutable to: identify a selection of the icon by the user via userinput other than the user looking at the icon; and based on theidentification of the selection of the icon, present a first graphicaluser interface (GUI) on the display, the first GUI comprising one ormore graphical elements other than the icon that are related to thesecond device.
 7. The first device of claim 6, wherein the first GUIpresented on the display comprises elements from a second GUI that thesecond device is configured to present on a display of the second deviceitself.
 8. The first device of claim 6, wherein the first GUI presentedon the display comprises a hybrid of respective elements from second andthird GUIs, the second device being configured to present the second andthird GUIs on a display of the second device itself, the second andthird GUIs being different from each other.
 9. The first device of claim1, wherein the instructions are executable to: based on theidentification of the user as looking at the icon and without receivinga wake up word from the user, issue the command to the second device.10. The first device of claim 1, wherein the at least one functionrelates to operation of hardware of the second device.
 11. A method,comprising: identifying a user of a first device as looking at one ormore of: a second device different from the first device, a graphicalelement presented on a display of the first device and associated withthe second device; identifying audible input from the user; and based onthe identifying of the user as looking at one or more of the seconddevice and the graphical element, transmitting an indication to thesecond device to execute at least one function in conformance with theaudible input.
 12. The method of claim 11, wherein the method comprisesidentifying the user as looking at the graphical element presented onthe display of the first device, and wherein the graphical element ispresented on the display so that the graphical element appears disposedwithin the real world at or proximate to a real world location of thesecond device.
 13. The method of claim 12, wherein proximate to the realworld location of the second device is within a threshold non-zerodistance to the second device.
 14. The method of claim 12, whereinproximate to the real world location of the second device is above ornext to the location of the second device at a same real world depth asthe second device relative to the first device.
 15. The method of claim11, wherein the graphical element is a first graphical element, whereinthe audible input is first audible input, wherein the indication is afirst indication, and wherein the method comprises: presenting a secondgraphical element on the display of the first device, the secondgraphical element being associated with a third device different fromthe first device and different from the second device, the secondgraphical element being different from the first graphical element, thesecond graphical element being concurrently presented on the displaywith the first graphical element; identifying the user of the firstdevice as looking at the second graphical element; identifying secondaudible input from the user; and based on the identification of the useras looking at the second graphical element, transmitting a secondindication to the third device to execute at least one function inconformance with the second audible input.
 16. The method of claim 11,comprising: presenting the graphical element on the display usingaugmented reality software; and identifying the user as looking at thegraphical element presented on the display of the first device.
 17. Themethod of claim 11, comprising: presenting the graphical element on thedisplay responsive to determining that the second device itself is notvisible to the user relative to the current location of the user; andidentifying the user as looking at the graphical element presented onthe display of the first device.
 18. At least one computer readablestorage medium (CRSM) that is not a transitory signal, the computerreadable storage medium comprising instructions executable by at leastone processor to: identify a user of a first device as looking at one ormore of: a second device different from the first device, a graphicalelement presented on a display of the first device and associated withthe second device; identify audible input from the user; and based onthe identification of the user as looking at one or more of the seconddevice and the graphical element, transmit a notification to the seconddevice to execute at least one function in conformance with the audibleinput.
 19. The CRSM of claim 18, wherein the instructions are executableto: identify the user of the first device as looking at the graphicalelement presented on the display of the second device; and based on theidentification of the user as looking at the graphical element, transmitthe notification to the second device.
 20. The CRSM of claim 19, whereinthe graphical element is presented on the display of the first device toappear at or proximate to the real world location of the second device.